The Disambiguation of Nominalisations
نویسنده
چکیده
This paper addresses the interpretation of nominalisations, a particular class of compound nouns whose head noun is derived from a verb and whose modifier is interpreted as an argument of this verb. Any attempt to automatically interpret nominalisations needs to take into account: (a) the selectional constraints imposed by the nominalised compound head, (b) the fact that the relation of the modifier and the head noun can be ambiguous, and (c) the fact that these constraints can be easily overridden by contextual or pragmatic factors. The interpretation of nominalisations poses a further challenge for probabilistic approaches since the argument relations between a head and its modifier are not readily available in the corpus. Even an approximation which maps the compound head to its underlying verb provides insufficient evidence. We present an approach which treats the interpretation task as a disambiguation problem and show how we can “recreate” the missing distributional evidence by exploiting partial parsing, smoothing techniques, and contextual information. We combine these distinct information sources using Ripper, a system that learns sets of rules from data, and achieve an accuracy of 86:1% (over a baseline of 61:5%) on the British National Corpus.
منابع مشابه
Automatic sortal Interpretation of German Nominalisations with -ung Towards using underspecified Representations in Corpora
In this paper we present work on using dependency structures in a process of automatic sortal interpretation of German nominalisations with -ung, such as Messung (‘measurement’) or Zählung (‘count’). Many such -ung nominalisations are ambiguous with respect to their sortal interpretation (cf. Ehrich and Rapp (2000) who lean heavily on McCawley (1968) and Lakoff (1972) for the notion of sortal a...
متن کاملSemantic Role Assignment for Event Nominalisations by Leveraging Verbal Data
This paper presents a novel approach to the task of semantic role labelling for event nominalisations, which make up a considerable fraction of predicates in running text, but are underrepresented in terms of training data and difficult to model. We propose to address this situation by data expansion. We construct a model for nominal role labelling solely from verbal training data. The best qua...
متن کاملStatistical Interpretation of Compound Nominalisations
This paper presents a method for detecting compound nominalisations from open data, and providing a semantic intepretation. It uses a statistical model based on confidence intervals over frequencies extracted from a large, balanced corpus. Using three paraphrases of the given compound nominalisation, and interpretation preferences of its components, the algorithm achieves about 70% accuracy in ...
متن کاملبهبود صحت ابهامزدایی نام نویسنده با استفاده از خوشهبندی تجمّعی
Today, digital libraries are important academic resources including millions of citations and bibliographic essential information such as titles, author's names and location of publications. From the view of knowledge accumulation management, the ability to search fast, accurate, desired contents, has a great importance. The complexity and similarity in these resources cause many challenges and...
متن کامل